Density guided importance sampling: application to a reduced model of protein folding

نویسندگان

  • Geraint Thomas
  • Richard B. Sessions
  • Martin J. Parker
چکیده

MOTIVATION Monte Carlo methods are the most effective means of exploring the energy landscapes of protein folding. The rugged topography of folding energy landscapes causes sampling inefficiencies however, particularly at low, physiological temperatures. RESULTS A hybrid Monte Carlo method, termed density guided importance sampling (DGIS), is presented that overcomes these sampling inefficiencies. The method is shown to be highly accurate and efficient in determining Boltzmann weighted structural metrics of a discrete off-lattice protein model. In comparison to the Metropolis Monte Carlo method, and the hybrid Monte Carlo methods, jump-walking, smart-walking and replica-exchange, the DGIS method is shown to be more efficient, requiring no parameter optimization. The method guides the simulation towards under-sampled regions of the energy spectrum and recognizes when equilibrium has been reached, avoiding arbitrary and excessively long simulation times. AVAILABILITY Fortran code available from authors upon request. CONTACT [email protected].

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Effective Sampling of Protein Conformation Space: Identification of Independent Folding Units

A simplified off-lattice model of protein molecules has been developed which uses a physico-chemical force field based on experimental data. Employing a search strategy based on Simulated Annealing, we are able to predict a high percentage of correct folds for proteins up to 40 residues in length. We have developed a robust and efficient hybrid Monte Carlo algorithm (Density Guided Importance S...

متن کامل

Protein Stability, Folding, Disaggregation and Etiology of Conformational Malfunctions

Estimation of protein stability is important for many reasons: first providing an understanding of the basic thermodynamics of the process of folding, protein engineering, and protein stability plays important role in biotechnology especially in food and protein drug design. Today, proteins are used in many branches, including industrial processes, pharmaceutical industry, and medical fields. A...

متن کامل

Enhancing Learning from Imbalanced Classes via Data Preprocessing: A Data-Driven Application in Metabolomics Data Mining

This paper presents a data mining application in metabolomics. It aims at building an enhanced machine learning classifier that can be used for diagnosing cachexia syndrome and identifying its involved biomarkers. To achieve this goal, a data-driven analysis is carried out using a public dataset consisting of 1H-NMR metabolite profile. This dataset suffers from the problem of imbalanced classes...

متن کامل

Protein Folding Simulations Combining Self-Guided Langevin Dynamics and Temperature-Based Replica Exchange.

Computer simulations are increasingly being used to predict thermodynamic observables for folding small proteins. Key to continued progress in this area is the development of algorithms that accelerate conformational sampling. Temperature-based replica exchange (ReX) is a commonly used protocol whereby simulations at several temperatures are simultaneously performed and temperatures are exchang...

متن کامل

A new sequential importance sampling method and its application to the two-dimensional hydrophobic–hydrophilic model

The sequential importance sampling method and its various modifications have been developed intensively and used effectively in diverse research areas ranging from polymer simulation to signal processing and statistical inference. We propose a new variant of the method, sequential importance sampling with pilot-exploration resampling ~SISPER!, and demonstrate its successful application in foldi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Bioinformatics

دوره 21 12  شماره 

صفحات  -

تاریخ انتشار 2005